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We consider a totally monotone capacity on a Polish space and 
a sequence of bounded p.i.i.d. random variables. We show that, on a 
full set, any cluster point of empirical averages lies between the lower 
and the upper Choquet integrals of the random variables, provided 
either the random variables or the capacity are continuous. 

I 1. Introduction. In this paper we prove a strong law of large numbers for 

' totally monotone capacities. Specifically, given a totally monotone capacity 

v denned on the Borel a-algebra B of a Polish space f2, and a sequence 
{X n } n >i of bounded, pairwise independent and identically distributed ran- 
dom variables, we show that 

i/ljwGfi: / Xx dv < liminf J 
IT)' <limsup J ~ X n ' <- I -X 1 dv\)=l, 
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| provided the X n s are continuous or simple, or v is continuous. In this way 

^) • we extend earlier results of Marinacci [13]. 

Under different names, totally monotone capacities have been widely stud- 



ied in both pure and applied mathematics. They have been introduced by 
Choquet [4] motivated by some problems in potential theory, and in his wake 
many works have studied them in both potential theory and probability the- 
ory (see, e.g., [6] and [8]). 
^ | In mathematical statistics and in mathematical economics, totally mono- 

$^ ' tone capacities have been used to represent subjective prior beliefs when the 

information on which such beliefs are based is not good enough to represent 
them by a standard additive probability (see, e.g., [7, 11, 15, 20] and [19]). 
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Our result shows that even in a nonadditive setting, the limit behavior of 
empirical averages has some noteworthy properties. In particular, we show 
that eventually empirical averages lie, with probability one, between the 
lower and upper Choquet integrals associated with the given capacity. This 
extends to the nonadditive setting the classic Kolmogorov limit law, to which 
our result reduces when v is additive since in this case lower and upper 
Choquet integrals coincide. 

In a subjective probability perspective, our result says that, while in the 
additive case a Bayesian decision maker believes that the empirical averages 
of a sequence of p.i.i.d. random variables tend to a given number, here he 
just believes that the limit behavior of the empirical averages is confined in 
a given interval. This reflects a possible lack of confidence in his probability 
assessments. This interpretation of nonadditive limit laws and its relevance 
in mathematical economics has been recently discussed at length in [9], to 
which we refer the interested reader for details and references. 

2. Preliminaries. Let f2 be a Polish space and B its Borel cr-algebra. 
A random variable (r.v.) is a (Borel) measurable function JT:f2^1R. A 
totally monotone capacity on B is a set function v:B^> [0, 1] such that: 

(c.l) i/(0) = and v(Q) = 1, 
(c.2) v{A) < v{B) for all Borel sets ACB, 
(c.3) v(B n ) I v{B) for all sequences of Borel sets B n [ B, 
(c.4) u(G n ) | v(G) for all sequences of open sets G n j G, 



(c5) u(\JJ =1 Bj) > E^jc { i„.„n}(-l) |J|+ Ma- e j^-) for every collection 



Bi, ... , B n of Borel sets. 

A set function v : B — ► [0, 1] such that: 

(c.6) f{B n ) 1 for all sequences of Borel sets B n j $7, 

is called continuous. A continuous set function v : B — ► [0,1] is a totally mono- 
tone capacity if and only if (c.l), (c.2) and (c.5) hold (see [18] and [14], 
Theorem 10). 

Let v be a totally monotone capacity on B. As in the additive case, we say 
that the elements of a sequence {X n } n >i of r.v.s are pairwise independent 
with respect to v if, for each n, m > 1 and for all open subsets G n , G m of R, 



we say that they are identically distributed if, for each n,m>l and each 
open subset G of R, 



The Choquet integral of a bounded r.v. X with respect to a totally mono- 
tone capacity v is defined by 



v{{X n G G n ,X m € G m }) = u{{X n G G n })v({X m G G m }); 



v(X n eG)=v(X m eG). 
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The integrals on the right-hand side are Riemann integrals and they are 
well defined since v({X > t}) is a monotone function in t. The Choquet 
integral is positively homogeneous, monotone and translation invariant [i.e., 
/ ( X + c) dv = J Xdv + c if c is constant] . It reduces to the standard integral 
when v is an additive probability measure. 

In general, J X dv < —J—Xdv. Equality holds for all r.v.s if and only 
if v is additive. The integrals f X dv and —J— Xdv are sometimes called 
lower and upper Choquet integrals, respectively. 

3. The law of large numbers. We can now state our main result. 

Theorem 1. Let v be a totally monotone capacity on B, and {X n } n >i 
a sequence of bounded, pairwise independent and identically distributed ran- 
dom variables. Then 



i/Mw£fl: / X\ dv < lim inf : 



?? 



< lim sup ^ 1 — — — - < — / — Xi dv 

n n J 

provided at least one the following two conditions holds: 

(i) v is continuous; 

(ii) the random variables X n are either continuous or simple. 

A few remarks are in order. First, as v{B) = 1 and A C B c imply v{A) = 0, 
under the assumptions of Theorem 1 we also have 



v\ \ oj e n : liminf n < I X 1 du \ ) = 



and 



v ( \ to € n : limsup J ~ — J > - / -X\ dv}) =0. 



In other words, with zero probability empirical averages will eventually lie 
outside the interval [J X\ dv, —J—Xi dv]. 

Second, when v is additive we have / X\ dv = —f —X\ dv, and so in this 
case our result reduces to a standard Kolmogorov limit law 

v ({ w e n : hm = j Xx dv ^ = i 

On the other hand, when v is not additive in some cases it may happen (see 
[13]) that 

( f E?=l X 3 (w) Ei=l X 3 (w) i \ 

v\ wGO: lim inf — < lim sup — > = 1. 

VI n n n nil 
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Finally, as anticipated, the closest existing theorem is due to [13]. Our 
result is more general since [13] assumes that f2 is compact, v is continuous, 
the r.v.s X n are continuous, independent and that they satisfy some further 
technical conditions. Moreover, the proof we provide is different and much 
simpler. In fact, here we develop a technique that relies on the relations 
between totally monotone capacities and correspondences, thus making it 
possible to use existing laws of large numbers for correspondences. This 
approach might be useful in establishing further generalizations of limit laws 
to the framework of capacities. This will be the object of future research, 
along with the possibility of weakening some of the continuity conditions 
assumed in Theorem 1. 



4. Proof and related material. Denote by /Co (resp. £/o) the class of 
all nonempty compact subsets (resp. open subsets) of £1; for the sake of 
completeness, write Bq instead of B. If d is a Polish metric on f2, then /Co 
is a Polish space when endowed with the Hausdorff metric 

d'uiK, L) = maxf maxmin d(k, I), maxmin d(L k) } . 

\k£K l&L l&L k&K J 

The Borel cr-algebra on /Co is also generated by the class {K G /Co : K C 
G}ceg n - 

4.1. Measurable correspondences and totally monotone capacities. Let 
(I,C, A) be a nonatomic and complete probability space. A (compact valued) 
correspondence F:I =$Q is a map with domain I and whose values are 
nonempty compact subsets of f2. For any A C f2, we put 

F_ l {A) = {seI:F(s)CA}. 

A correspondence F:I=$£l is measurable if G C for every G € Qn- 

As well known (see, e.g., [12]), the following facts are equivalent: 

• F is measurable; 

• F_i(B) G C for every B G B n ; 

• F is measurable as a function F : I — > /Co • 

When a measurable correspondence F is regarded as a measurable func- 
tion F .1 — > /Co, we denote by F~ 1 its standard inverse image, that is, 

F- 1 {£) = {seF.F(s)e£} V£C/Co, 

and by cr(F) the cr-algebra generated by F, that is, 

a(F) = {F- 1 (V):VeB ICn }. 



In the sequel we will need the next lemma, whose standard proof is omitted. 
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Lemma 2. Let F:I^Q be a measurable correspondence. Then 
a(F)=o-({F_ l (G):G£g n }) 
and {F-i(G) :G £ Qq} is a ir-class containing I. 

A measurable function / : / — > induces a probability distribution P* on 
denned by 

P f (B) = X(r\B)) VI?e£ n . 

In a similar way, a measurable correspondence F :I ^ Q induces a lower 
distribution vp on % denned by 

v F (B) = X(F_ x {B)) VSeBsi. 

The next result, which links totally monotone capacities and lower distribu- 
tions, is essentially due to Choquet [4] (see also [15, 16] and [3]). 

Lemma 3. A set function v.Bq — > [0,1] is a totally monotone capacity 
if and only if there exists a measurable correspondence F : I such that 

v = Vp. 

A measurable selection of a correspondence F : I z4 is a measurable 
function / : I — ► O such that f(s) € i*Xs) for almost all s E I. The set of all 
measurable selections of F is denoted by Sel-F. The Aumann integral (see 
[2] ) of a correspondence F :I with respect to A is defined by 

J F dX = I J f dX : / e Sel F and / integrable J . 

If X : — ► K is continuous or simple, and F is a correspondence, then 
(A" o F)(s) = X(F(s)) is a correspondence [i.e., X(F(s)) € /Or for all s£ J]. 
Moreover, since 

(IoF)_ 1 (i) = F_ 1 (J- 1 (i)) ViCR, 
X o F is measurable provided X and -F are measurable. 

Lemma 4. Lei F :I be a measurable correspondence, and X : — > M 
6e either bounded and continuous or simple and measurable. Then, 



J(XoF) dX 



Xdvp,— / —Xdvp 



This is an immediate consequence of [3], Theorem 4.1. 
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4.2. Proof of Theorem 1. Suppose first that (ii) holds. By Lemma 3, 
there exists a measurable correspondence F :I such that v = vp. 

Next we show that the measurable correspondences {J n oF} n >i are pair- 
wise independent and identically distributed when regarded as measurable 
functions X n oF:J-> /Qr. 

Let n, m > 1 and G n , G m £ <5jr, then 



X((X n o F)_x(G n ) n (X m o F)_i(G m )) 

= A( J F_ 1 (AT- 1 (G„))n J F_ 1 (X- 1 (G m ))) 

= A(F_ 1 (4 1 (G n )ni; 1 (G m ))) 
= ^(x- 1 (G n ,)nx- 1 (G m )) 

= ^(X- 1 (G„))K^ 1 (G' m )) 

= A(F_ 1 (X- 1 (G n )))A(F_ 1 (X- 1 (G m ))) 

= A((X„ o F)_!(G re ))A((X m o F)_x(G m )). 



This proves pairwise independence, since for all j = n, m, {(Xj oF)_i (G)}cegi 
is a 7r-class containing I and generating the a-algebra o~(Xj oF) (see Lemma 



Moreover, for each n,m> 1, and each open subset G € On, 



A((I„oF)^({F G/Cr:KCG})) 

= A({A n oFe {A' € /Cr : K C G}}) 
= A((X n oF)„ 1 (G)) 
= A(F_ 1 (X" 1 (G))) 

= "tf(G)) 
= i/(X" 1 (G)) 

= A((X m o F)-\{K elCm:KCG})). 



This proves identical distribution since {K £ JCr : if C G}Ge5 R is a 7r-class 
containing /Cr and generating B;c m - 

Clearly, for each n > 1 and each h 6 SelA n of, f hdX is finite (/i is 
bounded); moreover, by Lemma 4, f X n o F d\€ /Cr. 

In sum, {X n oF} n >i are pairwise independent and identically distributed 
measurable correspondences with J X n o F dX € /Cjr for all n > 1. A general- 
ization due to [10] (see also [5] and [17]) of a result of Artstein and Vitale 
[1] guarantees that 



2). 
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By Lemma 4, 



Jx l0 Fd\= J Xi dv, - J -X x dv 



Let a n (uj) = — J —— and set 



5 1 = ^seI:^J2X j (F(s))^ J Xidv,- J -Xidv j, 

52 = |s £ I ■ J X\ dv < liminf a n (oj) 

< limsupa n (io>) < — J —X\ dv V u € i^(s)|, 

0,2 = |w G : J X\ dv < liminf a n (u) < limsupa n (u;) < — J —X\ dv^. 

We want to show that v(Q,2) = 1. Notice that 

v(VL 2 ) = \({s £ / : F(s) C 2 }) = A(S 2 ). 
The next claim will be used to show that S\ C S^- 



Claim 1. Lei {-Kn} fre a sequence in /Cr stic/i i/ia£ -fT n — > [a, (3]. Then, 

a < lim inf k n < lim sup k n < (3 

n n 

for each sequence {k n } in M. such that k n £ K n for all n > 1. 

Proof. By definition of Hausdorff metric, -ftT n converges to if and 
only if max(max t „ tf „ mm r£[a ^ \t n - r|,max re [ Q , )i3] min inGjft ; n |r - i n |) -> 0, in 
particular 

(1) max min \t n — r\—>0. 

Let {k n .} be a subsequence of {A; n } such that k nj — > I G [—00, +00]. If ^ ^ 
[a,/3], then there exists e > such that eventually |fc n . — r| > e for all r € 
Hence, we have eventually min rg [ Q m \k n . — r \ > e, thus contradicting 
(1). □ 

If a € Si, then ± £™ =1 X,-(F(a)) -> [[Xi di/, -/ -Xi di/]. Hence, for all 
cu € we have a n (cu) = I E"=i G s £"=1 ^( F ( s )); b y Claim 

/ dv < liminf a n (uj) < limsupa n (u;) < — / — Xi <iiA 
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Therefore, Si C S 2 and so ^(^2) = ^{$2) > ^(Si) = 1- This completes the 
proof of the result when (ii) holds. 

As to (i), denoting by r the Polish topology on £1, there exists a Polish 
topology t* D t on $7 such that <t(t*) = Bq, and such that all the X n s 
are r*-continuous (see, e.g., [21]). Since v is continuous, then it is a totally 
monotone capacity with respect to the topology r* ; we can thus assume that 
(ii) holds. 

Acknowledgments. We thank Steve Lalley (the Editor) and an anony- 
mous referee for helpful suggestions. 
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